Coarse - Grained Parallel Algorithms forMulti - Dimensional Wavelet
نویسنده
چکیده
This paper presents parallel algorithms for computing multi-dimensional wavelet transforms on both shared memory and distributed memory machines. Traditional data partitioning methods for n-dimensional Discrete Wavelet Transforms (DWTs) call for data redistribution once a one dimensional wavelet transform is computed along each dimension. To avoid the data communication inherent in this redistribution, two new partitioning methods called CRBP (Communication Reduced Block Partitioning) and CRLP (Communication Reduced Layer Partitioning) are proposed. The eeciency of the algorithms is compared through several examples implemented on a cluster of SGI workstations. Two kinds of parallel approaches are used to compute multi-dimensional wavelet transforms on shared memory machines: homogeneous paral-lelism, and heterogeneous parallelism. Homogeneous parallelism uses traditional data partitioning while heterogeneous parallelism uses the CRBP approach. The eeectiveness of these approaches is demonstrated through several examples implemented on an SGI Power Challenge. The paper discusses the eeectiveness of each of the approaches on the two kinds of architectures.
منابع مشابه
Scalability of 2-d Wavelet Transform Algorithms: Analytical and Experimental Results on Coarse-grained Parallel Computers
We present analytical and experimental results for the scal-ability of 2-D discrete wavelet transform algorithms on coarse-grained parallel architectures. The principal operation in the 2-D DWT is the ltering operation used to implement the lter banks of the 2-D sub-band decomposition. We derive analytical results comparing time domain and frequency domain parallel algorithms for realizing the ...
متن کاملSCALABILITY OF 2-D WAVELET TRANSFORM ALGORITHMS: ANALYTICAL AND EXPERIMENTAL RESULTS ON COARSE-GRAIN - VLSI Signal Processing, IX, 1996., [Workshop on]
We present analytical and experimental results for the scalability of 2-D discrete wavelet transform algorithms on coarse-grained parallel architectures. The principal operation in the 2-D DWT is the filtering operation used to implement the filter banks of the 2-D subband decomposition. We derive analytical results comparing time domain and frequency domain parallel algorithms for realizing th...
متن کاملHeterogeneous acceleration of volumetric JPEG 2000 using OpenCL
This paper discusses an OpenCL version of a volumetric JPEG 2000 codec that runs on GPUs, multi-core processors or a combination of both. Since the performance critical part consists of a fine-grained (discrete wavelet transform) and coarse-grained algorithm (Tier-1), the best performance is obtained with a hybrid execution in which the discrete wavelet transform is executed on a GPU and Tier-1...
متن کاملPractical Algorithms for Selection on Coarse-Grained Parallel Computers
In this paper, we consider the problem of selection on coarse-grained distributed memory parallel computers. We discuss several deterministic and randomized algorithms for parallel selection. Experimental results on the CM5 demonstrate that randomized algorithms are superior to their deterministic counterparts.
متن کاملCoarse grained parallel algorithms for graph matching
Parallel graph algorithm design is a very well studied topic. Many results have been presented for the PRAM model. However, these algorithms are inherently fine grained and experiments show that PRAM algorithms do often not achieve the expected speedup on real machines because of large message overheads. In this paper, we present coarse grained parallel graph algorithms with small message overh...
متن کامل